10. Text + Quiz: Data Types (Ordinal vs. Nominal)
Recap of Previous Video
The table below summarizes our data types. To expand on the information in the table, you can look through the text that follows.
Data Types | ||
---|---|---|
Quantitative: | Continuous | Discrete |
Height, Age, Income | Pages in a Book, Trees in Yard, Dogs at a Coffee Shop | |
Categorical: | Ordinal | Nominal |
Letter Grade, Survey Rating | Gender, Marital Status, Breakfast Items |
Below is a little more detail of the information shared in the above table.
Another Look
To break down our data types, there are two main blocks:
Quantitative and Categorical
Quantitative
can be further divided into
Continuous
or
Discrete
.
Categorical
data can be divided into
Ordinal
or
Nominal
.
You should have now mastered what types of data in the world around us falls into each of these four buckets: Discrete, Continuous, Nominal, and Ordinal. In the next sections, we will work through the numeric summaries that relate specifically to quantitative variables.
Quantitative vs. Categorical
Some of these can be a bit tricky - notice even though zip codes are a number, they aren’t really a quantitative variable. If we add two zip codes together, we do not obtain any useful information from this new value. Therefore, this is a categorical variable.
Height
,
Age
, the
Number of Pages in a Book
and
Annual Income
all take on values that we can add, subtract and perform other operations with to gain useful insight. Hence, these are
quantitative
.
Gender
,
Letter Grade
,
Breakfast Type
,
Marital Status
, and
Zip Code
can be thought of as labels for a group of items or individuals. Hence, these are
categorical
.
Continuous vs. Discrete
To consider if we have continuous or discrete data, we should see if we can split our data into smaller and smaller units. Consider time - we could measure an event in years, months, days, hours, minutes, or seconds, and even at seconds we know there are smaller units we could measure time in. Therefore, we know this data type is continuous.
Height
,
age
, and
income
are all examples of
continuous data
. Alternatively, the
number of pages in a book
,
dogs I count outside a coffee shop
, or
trees in a yard
are
discrete data
. We would not want to split our dogs in half.
Ordinal vs. Nominal
In looking at categorical variables, we found
Gender
,
Marital Status
,
Zip Code
and your
Breakfast items
are
nominal variables
where there is no order ranking associated with this type of data. Whether you ate cereal, toast, eggs, or only coffee for breakfast; there is no rank ordering associated with your breakfast.
Alternatively, the
Letter Grade
or
Survey Ratings
have a rank ordering associated with it, as
ordinal data
. If you receive an A, this is higher than an A-. An A- is ranked higher than a B+, and so on… Ordinal variables frequently occur on rating scales from very poor to very good. In many cases we turn these ordinal variables into numbers, as we can more easily analyze them, but more on this later!
Final Words
In this section, we looked at the different data types we might work with in the world around us. When we work with data in the real world, it might not be very clean - sometimes there are typos or missing values. When this is the case, simply having some expertise regarding the data and knowing the data type can assist in our ability to ‘clean’ this data. Understanding data types can also assist in our ability to build visuals to best explain the data. But more on this very soon!
Nominal vs. Ordinal
SOLUTION:
- Types of Fruit (Apple, Banana, etc.)
- Types of Dog Breeds (German Shepherd, Collie, etc.)
- Genres of Movies (Horror, Comedy, etc.)
- Gender
- Nationality